[Train] Sync checkpoint conversion tools to the newest third_party/Megatron-LM #936

chai-xiaonan · 2025-11-27T09:45:28Z

The checkpoint conversion tool must be updated accordingly following the Megatron update.
Error occurred when converting the trained torch-format ckpt to HF safetensor format:
1）The parameter args.expert_tensor_parallel_sizecannot be retrieved.
2）The model_provideris missing the model_builderparameter.
Fixed the two issues mentioned above.

…the Megatron update.

CLAassistant · 2025-11-27T09:45:36Z

All committers have signed the CLA.

…L file, and resolved the absence of the tokenizer_typeparameter in the Aquila model conversion tool.

The checkpoint conversion tool must be updated accordingly following …

e933d75

…the Megatron update.

chai-xiaonan requested review from aoyulong, heavyrain-lzy and zhaoyinglia as code owners November 27, 2025 09:45

Merge branch 'flagos-ai:main' into update_conversion_tool

35b590f

lxd-cumt changed the title ~~update_conversion_tool~~ [Train] Sync checkpoint conversion tools to the newest third_patry/Megatron-LM Dec 1, 2025

lxd-cumt changed the title ~~[Train] Sync checkpoint conversion tools to the newest third_patry/Megatron-LM~~ [Train] Sync checkpoint conversion tools to the newest third_party/Megatron-LM Dec 1, 2025

chai-xiaonan added 2 commits December 1, 2025 15:18

Fixed the missing group_query_attentionparameter in the Qwen3-0.6 YAM…

f423ce6

…L file, and resolved the absence of the tokenizer_typeparameter in the Aquila model conversion tool.

pre-commit aquila_arg

3950a5c

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Train] Sync checkpoint conversion tools to the newest third_party/Megatron-LM #936

[Train] Sync checkpoint conversion tools to the newest third_party/Megatron-LM #936

Uh oh!

chai-xiaonan commented Nov 27, 2025

Uh oh!

CLAassistant commented Nov 27, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

[Train] Sync checkpoint conversion tools to the newest third_party/Megatron-LM #936

Are you sure you want to change the base?

[Train] Sync checkpoint conversion tools to the newest third_party/Megatron-LM #936

Uh oh!

Conversation

chai-xiaonan commented Nov 27, 2025

Uh oh!

CLAassistant commented Nov 27, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

CLAassistant commented Nov 27, 2025 •

edited

Loading